A highly scalable dense linear system solver for multiple right-hand sides in data analytics
نویسندگان
چکیده
We describe PP-BCG, a parallel iterative solver for the solution of dense and symmetric positive-definite linear systems with multiple right-hand sides suitable for MPPs. Such linear systems appear in the context of stochastic estimation of the diagonal of the matrix inverse in Uncertainty Quantification and the trace of matrix products in statistical analysis. We propose a novel numerical scheme based on the block Conjugate Gradient algorithm combined with Galerkin projections and describe its implementation. We test the method on model covariance matrices from uncertainty quantification, where the solution of the linear systems is typically used to estimate the diagonal of the matrix inverse. Numerical experiments on an MPP illustrate the performance of the proposed scheme in terms of efficiency and convergence rate, as well as its effectiveness relative to block Conjugate Gradient and the Cholesky-based ScaLAPACK solver. In particular, on a 4 rack BG/Q with up to 65536 cores using dense matrices of order as high as 0.5×10 and 800 rhs, PP-BCG is 2-3 times faster. Keywords-Linear systems, multiple right-hand sides, block Conjugate Gradient, Galerkin projections, distributed memory environments, uncertainty quantification
منابع مشابه
New variants of the global Krylov type methods for linear systems with multiple right-hand sides arising in elliptic PDEs
In this paper, we present new variants of global bi-conjugate gradient (Gl-BiCG) and global bi-conjugate residual (Gl-BiCR) methods for solving nonsymmetric linear systems with multiple right-hand sides. These methods are based on global oblique projections of the initial residual onto a matrix Krylov subspace. It is shown that these new algorithms converge faster and more smoothly than the Gl-...
متن کاملTrading Replication for Communication in Parallel Distributed-Memory Dense Solvers
We present new communication-efficient parallel dense linear solvers: a solver for triangular linear systems with multiple right-hand sides and an LU factorization algorithm. These solvers are highly parallel and they perform a factor of 0.4P1/6 less communication than existing algorithms, where P is number of processors. The new solvers reduce communication at the expense of using more tempora...
متن کاملA parallel linear solver for multilevel Toeplitz systems with possibly several right-hand sides
A Toeplitz matrix has constant diagonals; a multilevel Toeplitz matrix is defined recursively with respect to the levels by replacing the matrix elements with Toeplitz blocks. Multilevel Toeplitz linear systems appear in a wide range of applications in science and engineering. This paper discusses an MPI implementation for solving such a linear system by using the conjugate gradient algorithm. ...
متن کاملAn Augmented Incomplete Factorization Approach for Computing the Schur Complement in Stochastic Optimization
We present a scalable approach and implementation for solving stochastic optimization problems on high-performance computers. In this work we revisit the sparse linear algebra computations of the parallel solver PIPS with the goal of improving the shared-memory performance and decreasing the time to solution. These computations consist of solving sparse linear systems with multiple sparse right...
متن کاملOn Techniques to Improve Robustness and Scalability of a Parallel Hybrid Linear Solver
A hybrid linear solver based on the Schur complement method has great potential to be a general purpose solver scalable on tens of thousands of processors. For this, it is imperative to exploit two levels of parallelism; namely, solving independent subdomains in parallel and using multiple processors per subdomain. This hierarchical parallelism can lead to a scalable implementation which mainta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016